ON THE ZERO-TEMPERATURE LIMIT OF GIBBS STATES 



JEAN-RENE CHAZOTTES AND MICHAEL HOCHMAN 

O" 

' Abstract. We exhibit Lipschitz (and hence Holder) potentials on the 

full shift {0, 1} N such that the associated Gibbs measures fail to con- 
verge as the temperature goes to zero. Thus there are "exponentially 
r—{ • decaying" interactions on the configuration space {0, 1} Z for which the 

. zero-temperature limit of the associated Gibbs measures does not exist. 

^>0 ' In higher dimension, namely on the configuration space {0, 1} Z , d > 3, 

■ we show that this non-convergence behavior can occur for the equi- 

tIh . librium states of finite-range interactions, that is, for locally constant 

potentials. 

^— > . 

a . 

i— 1 1 1. Introduction 

| 1.1. Background. The central problem in equilibrium statistical mechan- 

7— i ' ics or thermodynamic formalism is the description of families of Gibbs states 

for a given interaction. Their members are parametrized by inverse temper- 
' ature, magnetic field, chemical potential, etc. The ultimate goal is then to 

r~~~ describe the set of Gibbs states as a function of these parameters. The zero 

' temperature limit is especially interesting since it is connected to "ground 

states" , that is, probability measures supported on configurations with min- 



imal specific energy 

^ ■ The purpose of this article is to shed some light on the zero-temperature 

limit in the case of classical lattice systems, that is, systems with a config- 
uration space of the form F^ d , where F is a finite set. We consider shift 
invariant, summable interactions <I> = (<£ b) Bcz d ,|B|<oo- For every j3 > 0, 
we denote by G((3$>) the (nonempty) set of Gibbs states of <I> at inverse 
temperature f3. It contains at least one shift-invariant Gibbs state [BJ. The 
question we are interested in is: 

What is the limiting behavior of Q(/3$) as (3 — >■ +oo ? 

When G{j3<&) is a singletom, we denote by fip$ its unique (and necessarily 
shift-invariant) element. Then the previous question becomes: 

Does the limit of exist, as f3 — >■ +oo? 
l 
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(Limits of measures should be understood in the weak-* sense). For the class 
of interactions we consider, shift-invariant Gibbs states are also equilibrium 
states. To define them, we need to introduce the function 



This function can be physically interpreted as the contribution of the lattice 
site to the energy in the configuration x. (0) 

Equilibrium states at inverse temperature f3 > are then shift-invariant 
measures which maximize the quantity 



over all shift-invariant probability measures v on F^ . Here h(v) is the 
Kolmogorov-Sinai entropy of u, and the supremum is called the (topologi- 
cal) pressure. When the lattice is one dimensional and the potential ip is 
Holder, there is a unique Gibbs measure which is also the unique equilib- 
rium measure. For d > 1 the notions generally are not equivalent and there 
may be multiple Gibbs states, even for finite-range interactions, but any 
shift-invariant Gibbs measure is an equilibrium state. 

As in |14| . we define zero-temperature equilibrium states as those shift- 
invariant probability measures which maximize § f ipdu among all shift- 
invariant measures v. It can be proven that the weak-* accumulation points 
of equilibrium states for a given interaction as (3 — > +oo are necessarily 
zero-temperature equilibrium states for that interaction. Zero-temperature 
equilibrium states are related to ground states (see [2] for details). 

1.2. The one-dimensional case. Let us make a few remarks about the 
ergodic perspective. Fix the usual metric 

d(-r iA = 0- max { fc : Xi=Vi V|i|<fc} 



on F . For a number of reasons, the usual class of "potentials" ip : F — > M. 
which are studied are Holder continuous ones. First, for these potentials 
the Gibbs measure /ig^ is unique for each (3 > (no phase transition). Sec- 
ond, this class of potentials arises naturally in the theory of differentiable 

^Since the interaction is shift-invariant, we can take any lattice site. Other definitions 
are possible |13l section 3.2], but all lead to the same expected value under a given shift- 
invariant measure. 

2 Or minimizes, depending on the sign convention for ip. 



B30 



(1.1) 
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dynamical systems (e.g. Axiom A diffeomorphisms): By choosing a suitable 
Markov partition of the phase space one can code such a diffeomorphism to 
a subshift of finite type in F z [I], and under this coding smooth potentials 
lift to Holder ones. And third, Holder potentials correspond to the natural 
objects in statistical mechanics, namely "exponentially decaying" interac- 
tions (<J?b) \13\ chapter 5]. We also note that the case when <p is locally 
constant corresponds to interactions of finite range; see below. 

There is a trick, due to Sinai, which allows one to reduce the study to a 
"one-sided" subshift of finite type of F N and a potential ip which depends 
only on "future" coordinates [T] . Thus it suffices to study the one-sided full 
shift, and our question can be formulated as follows: 

For Holder continuous p on F , when does lim iig^ exist? 

The existence of the zero-temperature limit has been verified in a number 
of situations, but, surprisingly, a systematic study of this question began 
only recently. When d = 1 and <p is locally constant (i.e. the interaction 
is finite range), the zero-temperature limit was proved to exist in [3] and 
was described explicitly in |10t H], In this case, the zero-temperature limit 
is supported on the union of finitely many transitive subshifts of finite type 
and is a convex combination of the entropy-maximizing measure on them. 
The case d = 1 with F a countable set was studied in O E]. 

Another class of examples where convergence may be verified arises as 
follows. Let X C F N be a subshift (a closed non-empty shift-invariant set) 
and define ip = ipx by 

P(y) = ~d(y, X) = - mf{e%, x) : x G X}. 

This is a Lipschitz function on F N with ip\x = and ip < 0. The ground 
states of ipx are then precisely the measures supported on X, and it follows 
that all accumulation points of {fJ-p v ,f3 > 0} are invariant measures sup- 
ported on A. In particular, when X has only one invariant measure [i (i.e. 
is uniquely ergodic), all accumulation points coincide, and we have — > /i 
as f3 — > +oo. 

The only example of non-convergence of which we are aware is by van 
Enter and Ruszel [15) . The example is of a nearest-neighbor potential model, 
but is defined over a continuous state space F (the circle). 
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This state of affairs has led to the belief that over finite state spaces 
convergence should generally hold. Our first result is a counterexample, 
showing that this is not the case: 

Theorem 1.1. There exist subshifts X C {0, 1} N so that, for the Lipschitz 
potential <fx(y) = —d(y,X), the family {[i/3 v ,/3 > 0} does not converge 
(weak-*) as (3 — > +oo. 

This theorem holds more generally for one-sided or two-sided mixing shifts 
of finite type. 

Our construction gives reasonable control over the dynamics of X and of 
the dynamics, number and geometry of the limit measures. An interesting 
consequence of the construction is that the set of limit measures need not 
be convex. We discuss these issues in section [H 

1.3. The multi-dimensional case. Our second result concerns higher di- 
mension: non-convergence can also arise when d > 3, even for finite-range 
interactions; contrast this with the positive result in dimension d = 1 where 
the zero temperature limit is known to exist in this case [10] H] . 

While the methods used in the one-dimensional case are fairly classi- 
cal and quite well-known in the dynamics community, the study of zero- 
temperature limits and ground states in higher dimensions turns out to be 
closely connected to symbolic dynamics, and our results rely heavily on re- 
cent progress in understanding of multidimensional subshifts of finite type, 
where computation theory plays a prominent role. Recall that a shift of finite 
type X C {0, l} z3 is a subshift defined by a finite set L of patterns and the 
condition that x G X if and only if no pattern from L appears in x. Given 
L C {0, 1} E one can define the finite-range interaction ($ b) Bcz d ,|B|<oo by 

1 otherwise 
and <1>b = for B / E; the associated potential on {0, l} zd is 

ip^x) — — $b(x) - [ 1 X ^ E G L 

B30 ^ I ^ otherwise. 

Clearly an invariant measure fi on {0, l} z<i satisfies J tpLd[i = if and only if 
(j, is supported on X; thus the shift-invariant ground states are precisely the 
shift invariant measures on X. In this sense (pi is similar to (fx (although 
there are some delicate differences, as we shall see in section [5]). 
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The main result of [7j provides a general method for transferring one- 
dimensional constructions to higher-dimensional SFTs with corresponding 
directional dynamics. Using this we are able to adapt the construction 
from theorem II .11 to the multidimensional case with a finite-range potential. 
Recall that an equilibrium state for a potential <p at inverse temperature f5 
is a shift-invariant measure which maximizes the functional Pg of Eq. (jl.ip . 

Theorem 1.2. For d > 3 there exist locally constant (i.e. finite-range) 
potentials <p on {0, 1} Z<1 such that for any family (// J g V3 ) / g>o in which [ip^ is an 
equilibrium state (i.e. a shift-invariant Gibbs state), the limit lim^+oo jjip^ 
does not exist. 

Some comments are in order because the previous statement is rather 
subtle. If there were a unique Gibbs state for each (5 then there would be 
a unique choice for fj,^, and the previous result could be formulated more 
transparently: there exist locally constant potentials such that lim^+oo fj,^ 
does not exist. But we believe that in our example uniqueness does not hold 
at low temperatures. 

A more precise way to state the previous theorem is to say that the set- valued 
sequence (^(/3<J>)) / g >0 does not converge in Hausdorff metric topology. 
Our result is about continuous families and does not contradict the fact that 
for each given family (fJ,p<p)p>o of equilibrium states, there always exists a 
subsequence (/3j)ieN such that the limit lim^oo np i{p exists. This is due to 
compactness of the space of probability measures. 

There is nothing new in the fact that one can choose some divergent family 
P ^ fJ-/3ip of equilibrium states. Think e.g. of the Ising model below the 
critical temperature {(3 large enough): One can choose a family which al- 
ternates between the + and — phases. However it is also possible to choose 
families which converge to one of the ground states. Let us insist that in 
contrast to this kind of situation we prove the existence of examples where 
it is not possible to choose any family which converges to a ground state. 
Let us say a few words about the limitations of this result. First, it seems 
likely that our examples support non-shift-invariant Gibbs states, i.e. Gibbs 
states which are not equilibrium states, and, furthermore, we do not know if 
the statement extends to them. Hence the requirement of shift-invar iance. 
As for the restriction d > 3, the method used in our construction, which 
produces a potential of the form (fL above, relies on the results from [7] 
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which at present are not available in d = 2; but probably they hold in that 
case as well. 

Problem. For d > 2, do there exist finite-range potentials on the d-lattice 
such that every family of Gibbs states {[ip§,(3 > 0} fails to converge as 
P -»• +oo? 

In the next section we construct the subshift X of theorem 11.11 Section 
[3] contains the analysis and proof of theorem 11.11 Section [4] contains some 
remarks and problems. Section [5] discusses the multidimensional case. 

Acknowledgement. We are grateful to Maria Isabel Cortez for pointing 
out a gap in an early version of this paper. We are also grateful to A. C. D. 
van Enter for useful comments. 



2. Construction of X 

For each k > we define by induction integers l k , and finite sets of blocks 
Ak,Bk C {0, lY k . The construction uses an auxiliary sequence of integers 
N%, N%, . . ., with Ni . . . Nk determining A4, B{,£i for i < k. Here we treat 
the Nk as given, but in fact at each stage we are free to choose based 
on the construction so far, and during the analysis in the next section we 
impose conditions on the relation between and Nk+i- 

Begin with £q = 5, and let 

A) = {00000,01000} 
B = {11111,10111}. 

Next, given A^_i, Bk_i and lk-i and the parameter N^, let be a block 
containing every block in (A^i U -B/c-1) 2 fc ~ 1+1 , e.g. enumerate all these 
blocks and concatenate them. 

We proceed in one of two ways, depending on whether k is odd or even. 
We denote by ab the concatenation of blocks a, b of symbols, and by a k the 
/c-fold concatenation of a block a. 

• If k is odd, let 

A k = {c k a Nk : a G A k _{\ 

B k = {c k bib 2 ■ ■ -b Nk : &j € 
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• If A; is even, set 

Ak = {ckaia 2 ...a Nk : ai G A k -i} 
B k = {c k b Nk : b G B k -i}. 

Thus ^4fc, -Bfc consist of blocks of the same length, which we denote Ik- Note 
that tk can be made arbitrarily large by increasing Nk- 

If we assume that Nk is large enough then one can identify the occur- 
rences of Ck in any long enough subword of length 2£k of a concatenation of 
blocks from Ak U Bk- This is shown by induction: first one shows that one 
can identify the Ak-i U i^_i-blocks, and then Ck is identifiable because it 
contains blocks from both Ak-i and Bk-\. 

For a set £ let S* denote the set of all concatenations of elements from a 
set S. Given a finite set L C {0, 1}* let 

(l) = Ut«(l n ) 

denote the subshift consisting of all shifts of concatenations of blocks from 
L. Note that if L' C L*, then (L') C (L). Let 

L fc = A k U Bfc, 

so that Lfe + i C L^, and define 

oo 

x = n <^> • 

fc=i 

Alternatively, X is the set of points x G {0, 1} N such that every finite block 
in x appears as a sub-block in a block from some 

3. Analysis of the zero-temperature limit 
We make some preliminary observations. For u G Lk let 
fi(u) = frequency of i in u. 
Then the following is clear from the construction: 

Lemma 3.1. If Nk/£k-i increases rapidly enough, then fo(u) > | for u G 
A k and f (u) <\foru£ B k . 

In fact it can be shown that X supports two ergodic measures, respectively 
giving mass > | and < | to the cylinder [0] . 
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The construction is designed so that the ratio l^l/ji^l fluctuates between 
very large and very small. More precisely, one may verify the following: 

Lemma 3.2. If N k /£ k _\ is sufficiently large, then 

• If k is odd then \B k \ > \A k \ 100 . 

• Ifkis even then \A k \ > |-B fe | 100 . 

The next two lemmas show that for certain values of (3 the measure [ip^ 
concentrates mostly on blocks from L k . Let 

Y k = {x G {0, 1} N : 2% ii+ 4_i] G L k for some i G [0,4 - !]}• 

Y k C {0, 1} N is an open and closed set. 

Lemma 3.3. For f3 = 2 3 ^, 

W<p{Y k ) > 1-2-'*. 
Proof. If x £ Y k , then we certainly have d(x,X) > 2~ 2lk . Therefore, 
J (3<pdnp v = J -/3d(y,X)dnp 9 (y) 

< -2 3 ^-2- 2 ^^({o,i} N \y fc ) 
= 2^(^(y fc )-i). 

Since h^p^) < 1 we have 

PfiM < j Pvdiip^ + 1 < 2?*n Plfi (y k ) - (2^ - 1). 

Finally, choosing v to be an invariant measure supported on X we have 
Pp(y) = h(v) > 0, hence Pp(np^) > Pp{v) > 0. Combining these we have 
the desired inequality. □ 

Lemma 3.4. For f3 = 2 3 ^ fc , for all large enough n at least half of the mass 
of fJ-ptp is concentrated on sequences u G {0, l} n which can be decomposed as 

(3.1) U = OVi O . . . V2 o ■ ■ ■ v m o 

where Vi G L k , the symbol o represents blocks o/0, l's (which may vary from 
place to place), and at least a (1 — 2~ ik ) -fraction of indices j G [0, n) lie in 
one of the Vi . 

Proof. Let 

F fe ' = {xG{0,l} N : x| [0 , 4 _i] G L fe }. 
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Since Y k = Ui^o 1 -^ *^fc' f rom the previous lemma and shift-invariance of 
fiptp, we see that 

Since (ip v is ergodic (being a Gibbs measure), by the ergodic theorem, for n 
large enough at least half the mass of is concentrated on points x £ X 
such that 

i#{t€[0,n-l] : T i xeY£}>Ul-2-*k). 
n l k 

Since the beginning of an L^-block is uniquely determined (because the c k 

blocks can be identified uniquely) we also have that if y £ Yl, then T l y ^ Y k ' 

for all 1 < i < lk- Thus if u is the initial n-segment of a point x as above, 

then there is a representation of u of the desired form. □ 

Next, we obtain a lower bound on P^(//g„): 

Lemma 3.5. If k is odd and j3 = 2~^ k then 

A similar statement holds for even k and A k . 

Proof. Let v be the entropy-maximizing measure on (B^). Since 

Pp(jJip v ) > Pp{») = hiy) - j fiipdv 

and h(y) = log j Bk ^ it suffices to show that 

(3.2) J ^(y)du(y) > -2^2^ . 

Indeed, if y G (B k ) then y = ab\02 ■ ■ ■ where hi G B k and a is the tail 
segment of a block in B k . Since, by construction, every concatenation of 
2 £ k _|_ i blocks from B k appears in X, it follows that the initial segment of 
y of length t k 2 lk appears in X, and therefore d(y,X) < 2~ f - k2tk , and (|3.2|) 
follows. □ 

The last component of the proof is to show that, for j3 = 2 3ik , the measures 
concentrate alternately B k and A k . This is essentially due to the fact 
that by the lemmas above, \xp^ is mostly supported on the blocks of and 
because of the appearance of entropy in the variational formula, it tends to 
give approximately equal mass to these blocks. Since |-Bfc|/|Lfc| — > 1 along 
the odd integers and |Afc|/|Lfc| —¥ 1 along the even ones, this implies that 
[ipp will alternately be supported mostly on B k and A k . 
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Here are the details. Denote by [u] the cylinder set defined by a block 
u G {0,1}*. 

Proposition 3.6. If N k increases sufficiently rapidly then for all 5 > and 

all sufficiently large k, if we set j3 k = 2~ 3ik then: if k is odd then 



(3-3) n hv { |J M) >l-8 

u£B k 

and if k even then 

Hf> hV ( U [«])>!-*■ 

u£A k 

Proof. We assume that Nk increases rapidly enough for the previous lemmas 
to hold and furthermore that, writing H(t) = —tlogt — (1 — t) log(l — t), 

H{2~ 1 *) 



log \B k \/£ 



and 



A' 



2 34 2 -4-2^ 











log\B k \/£ k 

as k — > oo along the odd integers, and similarly, with A k in place of B k , 
as fc — > oo along the even integers. This condition is easily satisfied by 
choosing N k large enough at each stage, since for fixed /c, as we increase N k 
the numerator decays to but the denominator does not. 

Under these hypotheses we establish the proposition for odd k, the case 
of even k being similar. Thus, we assume that \B k \ > |^4fc| 100 . Fix 5 > 
and suppose that (|3.3p fails for some k. For all large enough n lemma [37H 
implies that at least half the mass of te™ is concentrated on points whose 
initial n-segment is of the form (|3.ip . and, by the ergodic theorem and the 
assumed failure of (|3.3p . if n is large then with /^^-probability approaching 
1 the fraction of Uj's that belong to A k in the decomposition (|3.ip is at least 
5. 

For such an n we now perform a standard estimate to bound the entropy 
of fJ>/3 k (p- Applying e.g. Stirling's formula, the number of different ways the 
o's can appear in u is 



< (^j < 2 H(2 ~ lk)n . 



The positions of o's determines the positions of the Vi, and given this, the 
number of ways to fill in the so that at least a 5-fraction of them come 
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from A k is bounded from above by 

n/l k 

l4,J r lR,J n /4-r < 

4 



\A k \ r \B k \ n ^~ r < y ■ \A k \ Sn l^\B k \^- 6 >l"K 



Using the bound \A k \ < IB^ 1 / 100 and setting 

99 

5' = 5- — 
100 



we get 



< " . IB, |d-5>/4. 



Thus, for arbitrarily large n, half the mass of Hp kV> is concentrated on a set 
£fc C {0, l} n of cardinality 

\E k \ < 2 nH ( 2 ~ £fc )+ lo s n ~ lo s^ . 2 ( 1 -' 5 ')™iog|Bfcl/4. 
It follows from this and the Shannon-McMillan theorem that 



hence, since <p < 0, we have 

p h M ^ h M < (i - + ^(2- 4 ). 

Substituting the lower bound from lemma (|3.5p . we have 



lQg|jBfc| - 2 «*2-^-^* < (1 - <5Q l0g|J3fc| + tf(2^). 

By our assumptions about the growth of iVj. the inequality above is possible 
only for finitely many k. This completes the proof. □ 



We can now prove theorem 11.11 For 5 = choose the sequence N k so 
that the conclusion of the last proposition holds. Since the density of 0's in 
the blocks a 6 A k is > | and the density in the blocks b € B k is < |, it 
follows that for k large enough and (3 k = 2 _3 ^ fc , 

Hp k ([0]) < ^-5 if A; is odd 
2 

M/3fe ([0]) > — I" ^ if A; is even 
3 

Hence (yU ( a< /3 ) ( a>o does not weak-* converge. 
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4. Remarks 

4.1. Topological dynamics of X. In our example X is minimal. Indeed, 
any block a G appears in Ck+i and hence in every block in Lfc+i, so a 
appears in X with bounded gaps. Note that there are also minimal (non 
uniquely ergodic) systems X for which the zero-temperature limit exists. 

One can easily modify the construction to endow X with other dynamical 
properties, e.g. one can make X topologically mixing (our example is not, 
in fact it has a periodic factor of order 5). It is also simple to obtain positive 
entropy of X (and the limiting measures): form the product of the given 
example with a full shift. 

4.2. Measurable dynamics of the zero-temperature limits. In our 

example, (ftp^p^o has two ergodic accumulation points, and one can show 
that the convex combinations of these two are also accumulation points. 

In general, the set of accumulation points need not contain ergodic mea- 
sures, even when the zero-temperature limit exists. This is true even of lo- 
cally constant potentials [IQ1II]) and one can also construct examples which 
are simpler to analyze. For example, if X C {0, 1} N is a subshift invariant 
under involution f> 1 of {0, 1} N , and if X has precisely two invariant mea- 
sures fjf,fi" which are exchanged by this involution, then for the potential 
Vx(y) = -d(y, X) we will have lim^+oo fj,p v = \p! + \n" . 

The set of accumulation points also need not be convex. Using the 
same scheme as above one can construct a subshift X C {1,2, 3} z with 
three invariant measures = 1, 2, 3, by maintaining three sets of blocks 

Ak,Bk,Ck at each stage (rather than two). At each step of the construc- 
tion we choose the smallest of the sets and concatenate its blocks freely, but 
concatenate the blocks of the others in a constrained way, so that at the 
next stage the sizes of the selected set is much larger than the other two, 
which have not changed much in relative size. For each n there are always 
two sets (the two which are not growing very much at that stage) for which 
the number of re-blocks in one is much greater than in the other. Thus the 
Gibbs measures at the appropriate scale will have very small contributions 
from the smaller of these sets, and the accumulation points of /iRip will lie 
near the boundary of the simplex spanned by the //W (in our example there 
were only two sets and at each step one grew at the expense of the other; 
thus the relative number of re- blocks achieved all intermediate ratios). 
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Regarding the ergodic nature of the accumulation points, the same pe- 
riodicity of order five that obstructs topological mixing causes the ergodic 
invariant measures on X (i.e. the ergodic zero-temperature limits) to have 
e -27™/5 in their spectrum, but this can be avoided by introducing spacers into 
the construction. In this way one can make the limiting ergodic measures 
weak or strong mixing, and possibly K. 

Finally, we have the following variant of Radin's argument from [12]. Let 
fx be an ergodic probability measure for some measurable transformation 
of a Borel space, and h(/J,) < oo. By the Jewett-Krieger theorem [5] there 
is a subshift X on at most h(/i) + 1 symbols whose unique shift-invariant 
measure v is isomorphic to fi in the ergodic theory sense. For the potential 
ipx, all accumulation points of are invariant measures on X, so they all 
equal u; thus pLp v — > v as /3 — > +oo. This shows that the zero-temperature 
limit of Gibbs measures can have arbitrary isomorphism type, subject to 
the finite entropy constraint, and raises the analogous question for divergent 
potentials: 

Problem. Given arbitrary ergodic measures fi 1 , fi" of the same finite en- 
tropy, can one construct a Holder potential (p whose Gibbs measures 
have two ergodic accumulation points as j3 — > +oo, isomorphic respectively 
to fjf, //'? 

4.3. Maximization of marginal entropy. Let <p be a Holder potential 
and M the set of invariant probability measures \x for which J ipdfi is maxi- 
mal. It is known that if \x is an accumulation point of (^^^>) / g>o then \i G Ai 
and furthermore [i maximizes h(n) subject to this condition. 

In the example constructed above the potential ip had two 99-maximizing 
ergodic measures fj.',fi", and the key property that we utilized was that 
their marginals at certain scales had sufficiently different entropies. In fact, 
the measure maximizing the marginal entropy on {0, l} n for certain n was 
alternately very close to // and to fi". 

It is an interesting question if such a connection between zero-temperature 
convergence and marginal entropy exists in general. Let ip be a Holder 
potential, and for each n let .M* denote the set of marginal distributions 
produced by restricting [i G M. to {0, l} n . The entropy function H(-) is 
strictly concave on .A/f * , and therefore there is a unique /i* 6M* maximizing 
the entropy function. Let 

M n = {fl G M : /U|{ ,l}n = Mnl- 
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This is the set of (^-maximizing measures which maximize entropy on re- 
blocks. Note that the diameter of Ai n tends to as re —t- oo in any weak-* 
compatible metric. Hence we can interpret A4 n — > [i in the obvious way. 

Problem. Is the existence of a zero-temperature limit for ip equivalent to 
existence of lim.M n ? More generally, do (/x ( a ¥ ,) ( a>o and (A4 n )n>o have the 
same accumulation points? 

5. The multidimensional case 

In this section we apply the main theorem of [7J to obtain a locally con- 
stant potential (i.e. a finite-range interaction) in dimension d > 3 such 
that any associated family of equilibrium measures does not converge as 
{3 —7- +00. Our methods do not work in d = 2, because the results of [7] are 
not known in that case, but probably a more direct construction is possible. 

5.1. SFTs and their subdynamics. The metric on {0, 1}^ is defined bj{^ 

d(x,y) = 2 -min {H u ll - x ( u )^y( u )} 

where ||-|| is the sup-norm. We denote by T the shift action on {0, l} z<i and 
write Tx, . . . , for its generators. 
Let 

E n = {-n,...,0,,...,n} d 

denote the discrete d-dimensional cube of side 2re + 1. A subshift X is a shift 
of finite type (SFT) if there is an re and finite set of patterns L C {0, l} En 
such that 

X = {x £ {0, 1} Z : no pattern from L appears in x}. 

(Note: here L determines the forbidden patterns, which is the opposite of its 
role in (L).) A pattern a is said to be locally admissible if it does not contain 
any patterns from L; it is globally admissible if it appears in X, i.e. it can 
be extended to a configuration on all of Z d which does not contain patterns 
from L. These two notions are distinct, and it is formally impossible to 
decide in general, given L, whether a locally admissible word is globally 
admissible. 
If we write 



(5-1) <p L (v) 



"I V\En G L 

otherwise 



^The dimension of the ambient space is also denoted d but no confusion should arise. 
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then every invariant measure [i on {0, 1} Z satisfies J (fLd[i < with equality 
if and only if fi is supported on X. Thus for any SFT X there is a locally 
constant potential whose maximizing measures are precisely the invariant 
measures on X. 

Given a subshift X C {0, l} z<i , we may consider the restricted one- 
parameter action of T\ on X. We shall say that the topological dynamical 
system (X, T\ ) is a (one-dimensional) subaction of (X, T) . To each partition 
C = {Ci, . . . , C m } of {0, l} zd into closed and open sets we associate to each 
x € X its itinerary x c given by the action of T\ and the partition C, i.e. 
x i->- x G {1, ... , m} z is defined by 

x (i) = j if and only if T\x G Cj. 

The subshift 

X c = {x c : xeX}C{l,..., m} z 

is a factor, in the sense of topological dynamics, of the subaction (X, Ti). 

For a subshift Y C {0, 1} Z write L k (Y) C {0, l} fc for the set of fc-blocks 
appearing in F; note that for any sequence k(i) — > oo the sets i = 

1,2,.. ., determine Y. 

The main result of [7] says that the subaction of SFTs can be made to 
look like an arbitrary subshift, as long as the subshift is constructive in a 
certain formal sense. The version we need is the following: 

Theorem. Let A be an algorithm that for each i compute^ an integer n(i) 
and a set L{ C {0, 1, . . . ,r} n ® such that (Lj) 5 (-^i+i)- Then there is an 
alphabet X, an SFT X C S o/ entropy and a closed and open partition 
C = {Co,C\, . . . ,C r } of ' Yp i such that L n ^(X c ) = Li, and consequently 
X c = n (Lj). Furthermore, the partition elements Ci can be made invariant 
under the shifts T<i and T3. 

To apply this one usually begins with a subshift Y which has been con- 
structed in some explicit manner, and a computable sequence n{i) (e.g. 
n(i) = i), and derives an algorithm which from i computes L n ^\(Y); one 
then gets an SFT X and partition C so that X c = Y. This means that for 



A stronger statement can be made in which the computability is replaced with semi- 
computability of an appropriate family of blocks, and then one obtains (nearly) a charac- 
terization; but we do not need this here. 
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all practical nearly purposes (e.g. the construction of counterexamples) one 
can realize arbitrary dynamics as the subdynamics of an SFT@ 

From the result for dimension d = 3 the same is easily seen to hold for 
d > 3, but it is not known whether this holds in dimension d = 2. 

5.2. A modified one-dimensional example. For notational convenience, 
for the rest of the paper we concentrate on the case d = 3, the general case 
being similar. 

Realizing a specific subshift (such as the one from section [2]) as the sub- 
action of an SFT X does not in itself give good control over the equilibrium 
measures of <px ° r <Pl- Indeed, the size of L n (X c ) is exponential in n, which 
implies similar growth of the corresponding set L n (X), but does not guaran- 
tee exponential growth in n 3 , which is the appropriate scale for 3-dimensional 
subshifts. Thus for example we can have h(X c ) > but h{X) = 0. 

In order to use subactions to control entropy of the full Z 3 action we rely 
on a trick by which the frequency of symbols in X c can be used to control 
pattern counts in a certain extension of X. This approach was used in [8j [2]. 

We begin by modifying the main example of this paper so as to control 
frequencies rather than block counts. We define a sequence of integers l k and 
sets of blocks A k , B k C {0, 1, 2} £k by induction, using an auxiliary sequence 
N\, N2 ■ ■ ■ of integers. 

Start with l Q = 2 and A = {00,01}, B = {00,02}. Next, given k define 

A k = {« 1+2£fc - 1+ ^ : a e ^-1} 

and for k even define 

A k = {a^- 1 !^ :o€i4*_i} 
B k = {b 1 ^-^ : b € B k }. 

Let £ k be the common length of blocks in the sets above, i.e. l k = £ k -i(2 ik - 1+1 + 
N k ). Note that l 4 E A k and 2 £k £ B k . 

As k —7- 00 the frequency of 0's in the blocks of A k ,B k tends to 0, and 
the frequency of l's and 2's tends, respectively, to 1, and we can control the 
relative speed at which they do so. More precisely, there is a function N k (-) 



Nevertheless, one should bear in mind that the family of SFTs (and the set of algorithms) 
is countable. 
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such that given Ni, . . . , Nk-i and Np. > N k (Ni, . . . , Nk-i) we have 

f (a) > 100/o(6) for k odd, a £ A k ,b € B k 

f (b) > WOfo(a) for k even, a G A k , b € B k . 

(Recall that fo(x) is the frequency of the symbol in x.) 
Define 

00 

Y=f] {A k U B k ) . 

k=l 

Similarly define 

Yi = f](A k ) 

and 

Y 2 = f\(B k ). 

(notice that these are decreasing intersections) . Note that the only invariant 
measures on Y are the point masses at the fixed points I 00 G Y\ and 2°° G Y 2 . 
We denote 

(5.2) 4 = 4-1 ((IAk-I + \B k \) M * + N k (N 1 ... JV fc _j)) 

(so 4 > £' fc ) and note that as long as iV*. > N k (Ni, . . . , N k _i), the set L|/ (F) 
is in fact independent of N k and depends only on N\, . . . ,N k _i. We also 
note that N k can be computed explicitly, and in particular the function 
(k, Ni, . . . , -/Vfc_i) 1 y N k (Ni, . . . , N k ^i) is a formally computable function. 

5.3. Controlling pattern counts in a 3-dimensional SFT. We now 

incorporate the subshift Y constructed above into a 3-dimensional SFT and 
use the control over the frequency of symbols in Y to gain control of the 
pattern counts of an associated SFT. 

First, some notation: for a subshift X C S^ 3 write 

L n {X) = {x\ En :ia}CE £ " 

where E n = {— n, ...,n} 3 . This is the same notation we used for one- 
dimensional subshifts, but the meaning will be clear from the context. We 
remark that if the (topological) entropy of X is then |L n (X)| = o(|-E n |). 

Apply theorem 15.11 to Y (or, rather, to an algorithm that computes a 
sequence L n i k \{Y); we shall be more precise later about the algorithm used). 
We obtain a zero-entropy SFT X C and C = {C 0l d,C 2 } a T 2 ,T 3 - 
invariant partition so that X = Y . 
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Next, for x G X and u = (ux,U2,u%) G Z 3 , if a; (ui) = (i.e. if T" 1 ^ G 
Co) we "color" the site with one of the two colors 0',0". Otherwise we 
leave it "blank". Collect all such colorings into a new subshift X. Formally, 
X C X x {0',0",blank} z3 is defined by 

X = {(x,y) £ X x {0',0", blank} : y(u) = blank if x c (ux) ^ 0}. 

For x = (xi,X2) G X we also write instead of xf . One may verify that 
X is an SFT. We write £ = £ x {0', 0", blank} for the alphabet of X and 
write L for the finite set of patterns whose exclusion defines X. We may 
assume that if a pattern over £ is locally admissible for L then the pattern 
induced from its first component is locally admissible for L. 

Notice that, since Co,Ci,C2 are invariant under T 2 ,T^, the pattern of 
symbols 0', 0" in a point x G X is the union of affine planes whose direction 
is spanned by (0, 1, 0), (0, 0, 1). The sequence of coordinates at which these 
planes intersect the x-axis corresponds to the location of 0-s in x , and 
on each plane the symbols 0',0" are distributed as randomly as possible, 
i.e. given the arrangement of affine planes there is no restriction on the 
combinations of 0',0" that may appear in them. It follows that if a G 
{0, 1, 2}{- n >-' n > is a block in Y then 

#{(x,y)\ En : (x,y) G X and x c \ { _ n _ n} = a} = 2 M-)\ E n\+o(\E n \) 

(the term o(|£/ n |) comes from the pattern growth of X, which has entropy 
0). 

Write 

X x = {x G X : x c G Yx} 
X 2 = {x G X : x C G Y 2 }. 

Then, for k large enough, the frequency gap between blocks in and 
translates into 

|L 4 (l"i)| > \L ek (X 2 )\^ 10 fcodd 
\L tk (X 2 )\ > IL^Xx)^ 10 keven. 

Compare this with lemma [3721 

5.4. Local versus global admissibility. For <p = (p&, i.e. f(y) = —d(y, X), 
one can adapt the analysis in section [3] and show that does not have 
a limit as (5 — > +oo. Let us review this argument. Fix (3 = 2 _3 ^ fe , and set 
p = 1, 2 according to whether k is odd or even, and write q = 2 — p for the 
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other index. First, an in Lemma 13.51 we prove a lower bound on Ppdipy,) 
by constructing a measure Vk whose blocks (i.e. square patterns) are over- 
whelmingly drawn from Li k (X p ), making it nearly (^-maximizing, and with 
entropy close to j^—r\L£ k (X p )\. This forces the entropy of fip v to be similar. 
Second, we use the fact that most of the mass of /x^ concentrates on blocks 
from L>£ k (X) and the fact that L,£ k (X p ) 3> L( k (X q ) to deduce that in order 
for [ipy, to have entropy near r^—r\Li k (X p )\, it must be mostly concentrated 

on X p . This argument is similar to that in proposition 13.61 

We are now interested in proving the same thing for the potential ip-^ 
(given in (|5.1|) ) instead of ipg. The first part of the analysis above carries 
over with only minor modifications. 

However, the second part runs into difficulties. Notice that f (fj^dfi « 
implies that nearly all the fip^-mass is concentrated on patterns in Li k (X), 
but f (fTcLfJ, ~ tells us only that /U^-most blocks on Eg k are locally ad- 
missible for L; they do not have to be globally admissible, giving us little 
control of their structure. 

To pull things through, we will make use of the following observation: it 
is not necessary for us to know that most of the mass of jip^ concentrates 
on Li k (X). Instead, it suffices that it concentrates on Lgi (X), where £' is 
as in equation (|5.2|) . This is because (X p ) is already much larger than 
Lff (X q ), so we can argue as in the first part of the proof of proposition 13.61 

Thus, to complete the construction we want to ensure that if a block 
a G T, Ee k is locally admissible then o\e , is globally admissible, i.e. belongs 

to L e , k (X). 

A simple compactness argument establishes the following general fact: For 
any SFT and m G N there is an R so that if b G T, Er is locally admissible 
then b\E m is globally admissible. In general, however, R depends in a very 
complicated way on both the SFT and m, and in fact is not formally com- 
putable given these parameters. For our purposes we require finer control 
than this. Luckily, an inspection of the proof in [7] gives the following: 

Theorem 5.1. Let A be an algorithm that from i computes n(i) G N and 
Li C {0,1,... ,r} n W such that {Li) 5 (Lj+i). Denote by T{ the number of 
time-steps required for the computation on input i. Then the SFT X from 
theorem \5.1\ can be chosen so that, for Ri = Ri(\A\,Ti, . . . , tj) , if a £ T, ER i 
is locally admissible then ci\e n is globally admissible, and furthermore the 
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function Ri(. . .) is computable. Here r\ and A are taken with respect to some 
fixed universal Turing machine. 

5.5. Completing the construction: The fine print. We now specify 
an algorithm A which, given i, computes sequences n(i) G N and Li C 
{0, 1,2}™W so that (Li) D (Lj+i). The even elements n(2k) are the lengths 
4 associated to a sequence in the construction in section 15.21 i.e. 

_ n(2Jfc) - n(2k - 2) • 2 1+n ( 2fc ~ 2 ) 
k ~ n(2k - 2) 

The odd elements of the sequence are 

n(2k -l)=e k = h-iMNt, . . . , JV fc _!). 

Note that, having determined n(i), the blocks in K of length n(i) depend 
only on N±, . . . , N\ n uy 2 ] and not on any future choices of parameters of the 
construction. Hence Lj = L n u\{Y) is well defined given n(l), . . . ,n(i) and 
may be computed from this data. Thus at the i-th stage of the computation 
we will write L n u)(Y) even though strictly speaking Y is not yet defined. 
On input i the algorithm is as follows. 

Case 0: i = 1. Output 

n(l) = 1 
Li = {0,1,2}. 

Case 1: i = 2k — 1. Recursively compute JVi, . . . , N^-i, and output 
n{i) =4 = 4-l# fc (JVi,...,iV fc _i) 
^« = L n ty(Y). 

Case 2: i = 2k. Recursively compute N m ,m < k and the time n, . . . , rj_i 
spent by the algorithm when run on each of the inputs j = 1, . . . ,i — 
1. Let 

7V fc = (max{n(i-l),R(\A\,T 1 ,...,n- 1 )}) 2 
and output 

= 4 = N k £k-l 
Li = L n ^(Y). 

Realizing such an algorithm (which can simulate itself) is a non-trivial but 
standard exercise in computation theory. 
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We can now sketch the remainder of the proof of theorem ll.2[ Using A as 
input to theorem 15,11 we obtain an SFT X C £^ 3 and associated partition 
C = {C Q ,C 1 ,C 2 } of {0, l,2} z3 , invariant under T 2 ,T 3 , such that X c = Y. 
Next, form the SFT X as explained above, defined by a set L of excluded 
patterns. 

For (3 = 2 3£k let /j,^ be an equilibrium measure associated to the potential 
By the definition of equilibrium measures we have f Lpjd\x^ > — c2~ 3ik , 
where c = log |S| is the maximal entropy achieved by an invariant measures 
on the full shift ; in section [3] this constant was 1. Thus in a //g^-typical 
configuration the density of patterns from L is < c2~ 3ik . Hence for r = ^[t^ 
and large enough k, with ^^-probability > 1 — 2~ 2ik a configuration x 
satisfies that x\e t is globally admissible. By our choice of Ik = n(2k) we 
have r > R(\A\,Ni, . . . , N^), so x\E n(2k _ 1) is globally admissible. But since 
n(2k — 1) > £' k , we are in the situation described at the end of the previous 
subsection, and this is enough to conclude that [i^ is mostly concentrated 
on X\ or X 2 , depending on k mod 2; so [xp^ diverges along /3 = 2~ 3ik . 
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